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Abstract 


The  dynamics  by  which  a  stationary  time  series  produces  its 
zero-crossings  and  higher  order  zero-crossings  sequentially  in 
time  is  studied,  illustrated,  and  applied  in  white  noise  testing, 
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1.  Introduction 


Consider  a  stationary  time  series  evolving  in  time  about 
level  zero.  As  the  time  series  threads  its  sample  path  about  this 
level  the  zero-crossing  rate  (the  number  of  zero-crossings  per 
unit  time)  converges  in  some  sense  as  the  series  length  increases. 
Similarly,  the  zero-crossing  rate  in  the  first  difference  of  the 
series  converges  too,  and  the  same  can  be  said  about  the  zero- 
crossing  rates  of  higher  order  differences  of  the  time  series.  In 
this  paper  we  examine  the  dynamic  process  by  which  a  time  series 
and  its  higher  order  differences  produce  their  respective  zero¬ 
crossing  counts  sequentially  in  time  for  the  purpose  of  white 
noise  testing.  Because  a  zero-crossing  rate  as  a  function  of  the 
series  length  is  a  stochastic  process,  we  can  reiterate  by  saying 
that  we  shall  study  zero-crossing  rate  processes  obtained  from  the 
original  time  series  by  repeated  differencing.  Such  processes 
will  be  called  higher  order  crossings  rate  processes  or  simply 
HOC-processes.  The  present  work  is  an  extension  of  the  methods 
and  ideas  discussed  in  Kedem  (1987)  and  in  Kedem  and  Martin  (1987) 
where  HOC  plots  and  HOC-processes  are  introduced  as  graphical 
tools  in  time  series  model  identification.  Here  the  emphasis  is 
on  a  graphical  tool  that  is  more  powerful  than  HOC  plots  for  white 
noise  testing. 

There  is  a  difference  between  regular  higher  order  crossings 
(HOC)  and  HOC-processes.  HOC  are  zero-crossing  counts  in  filtered 
time  series  where  the  filtering  may  consist  of  any  combination  of 
linear  filters  applied  sequentially  or  repeatedly.  The  distinc¬ 
tion  lies  in  the  sample  size.  HOC  are  defined  for  fixed  record 


w*  n«  WM  n  -  -  »  - 


lengths  while  in  HOC-processes  the  record  lengths  increase.  In 
this  paper  by  HOC  we  always  mean  higher  order  crossings  obtained 
by  repeated  differencing.  General  properties  of  HOC  are  reviewed 
in  Kedem  ( 1986) . 

In  general  HOC  constitute  a  monotone  increasing  sequence. 

This  in  turns  implies  that  the  <k+l)'th  order  HOC-process  tends 
C‘,  nave  sample  paths  that  lie  above  the  sample  paths  of  the  k'th 
order  HOC-process,  a  fact  which  induces  a  certain  weak  convergence 
of  HOC-processes.  Thus  there  are  two  basic  modes  of  convergence 
associated  with  HOC-processes,  "horizontal"  that  depends  on  the 
series  size  and  "vertical"  that  depends  on  the  difference  order. 
Both  modes  of  convergence  summarize  the  oscillatory  behavior  of  a 
time  series.  The  initial  rate  of  the  vertical  convergence  is 
rather  fast  which  means  that  very  few  HOC-processes  are  needed  for 
our  purpose . 

The  first  part  of  the  paper  deals  with  basic  properties  of 
HOC  and  HOC-processes.  Section  4  discusses  in  some  detail  the 
difficulty  in  obtaining  the  exact  variance  of  HOC  and  suggests 
instead  a  useful  approximation.  Section  5  is  where  we  describe 
and  illustrate  our  test  procedure. 


2.0  DEFINITION  OF  HOC  PROCESSES 

We  are  given  a  time  series  {Z^},  t  =*  ,  ••*,  -1,  0,  1, 
with  covariance  function 


/  ( t ,  r )  =  Cov ( Z  ,  Z  ) ,  r,t  =  0,±  1,±  2, 
t  t  -  r 


and  correlation  function 


P( t.r)  =  r (t ,r) /r {t ,0) ,  r,t  =  0,±  1,±  2,  ••• 

We  assume  the  process  is  stationary  with  zero  mean,  so  that 


=  E  Zt  =  E  =  E  Zt+J 


r(t,r)  =  E  {Zt  Zt_r>  *  rv 


=  rr  /  r0‘ 


r  =  0,  ±1,  ±  2 ,  •  •  •  .  Finally  we  assume  that  the  process  {Z^.}  is 


Gaussian. 


Let  { z|° ^ }  =  {Zt>  be  the  0th-order  difference  process.  To 
form  the  higher  order  difference  processes,  we  begin  by  defining 
the  first  order  difference  process  (zj.1^)  such  that 

(1)  _  (0)  _  (0) 

Zt  “  Zt  Zt-1  V  V 

In  general  for  the  k^-order  differences,  define 


z(k+l)  =  z(k)  .  z(k)  .  v(k+1)2tf  k  =  0flf  2( 


(2.1) 


The  processes  (Z^.  )  are  referred  to  in  the  remainder  of  this 

paper  as  "Z-processes" . 

From  the  k-lth-order  difference  process  z|k  ^  we  define 
the  kt^-order  "clipped"  process  (X^k^ }  such  that 


xjk)  =  I  (z[k  1]  >  0) ,  t  «  0,  il,  i2, 


(2.2) 
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where  I(*)  is  the  indicator  function.  The  binary-valued 
(k) 

processes  {X^  }  are  referred  to  as  "X-processes" .  We  further 

define 


(2.3) 


( k )  ( k-1 ) 

Thus  d^.  =  1  whenever  a  zero-crossing  occurs  between 

( k-l ) 

and  Z^  ,  and  is  0  otherwise. 

The  total  number  of  zero-crossings  which  occur  between  t  =  1 
and  t  *  N  in  the  ( k-l ) *^-order  process  is  then 


Vn  -  d2  +  d3  ~  —  • 


(2.4) 


The  processes  (D^  ^/N)  are  referred  to  as  HOC-processes  or 
simply  "D-processes" ,  with  k  being  the  order  and  N,  N  >  2,  the 
number  of  observations.  Thus  the  first  order  HOC-process  is 
{ D i  n/h),  N  =  2,3,-..  .  More  precisely,  (D^  N/N)  18  a  sequence 

of  processes  with  parameter  N  indexed  by  k. 


3.0  PROPERTIES  OF  D-PROCESSES 


3.1  ACF  relationships 

Because  the  process  is  obtained  from  simple 

differencing  of  the  process  (Zj.  ),  there  is  a  direct  relation¬ 
ship  between  the  correlation  functions  of  the  Z-processes. 

Let 


=  covariance  function  for  {z|k^ } 


(k)  - 


=  correlation  function  for  (Z^.  }. 


From  the  definition  of  {Z| 


Zik’  'iSo  IM)1  [9  Zt-1  11 


so  that 


(k) 

=  E  < 

rz(k)  z(k) 

r 

t  t  t-r 

(k) 

r 

k 

k 

=  E 

i  _ A 

E  (-D 

J  _  A 

Jo  Jo  K  (5)  E<Z-J  Zt-r-i  1 


Rearranging , 


'0°  |  g)  ri0U 


i- j=m 
0<  is  k 
0<  jsk 


We  know  that  for  positive  integers  a,  b,  n 


to]  0  *  0  w . 0  0  -  m 


and  we  observe  that 


■  E 

i-  j=m 
Os  i<  k 
Os  j<  k 


0  0  *  0  0  *  (-M . 0  P 


a 


0-1) 


.  *.  v*,  -  .  a  :*.  .  '  >  .*  /•  «“*V* V'j,*’  A/’j 


inumjr-irw  ^  ^ 


■  («y  o  •  (,-:.,)  („y  ••■■•  is]  (,y 

■  e*j 


Therefore , 


<k)  _ 


'  (?)  'i0>  ♦£  (?.)  «-*>"  ('i!i  *  '£) 


and  we  can  obtain  an  expression  for  p^  '  setting  r  *  1 


,(k)  _ 


f2k  V  .  f2k'l^f2k  1  „  .  f2k  1  f2k  1  .  ^  ,  .k 

k-1  pl(  k  k-2  p2(  k-1  k-3  1  (  1}  pk+l 


[kk]‘2pl  [kkl]+2p2  [k-j-^a^-J 


(3-2) 


+(-l)  2p. 


For  r  2  1,  we  can  obtain  from  the  basic  relationships 


(k+1)  7 ( k+1 ) 

(k+i)  =  *  t  ^t-r  ' 

E  {  (Z<k+1))2  ) 


E  {  (Z<k)  -  Z<k{)  <Z<k>  -  Z^j)} 
E  {  <z'k)-  Z^kj)2  ) 


so  that 


o  v(k)  _  (k)  _  (k) 

rr _ 'r  +  l  'r-l 

o  v ( k ) _  9  (k) 

r0  r  1 


3 


3.2  Expected  Values  for  D-Processes 


When  the  underlying  proces  {Z^}  Is  Gaussian,  the  expected 
number  of  zero  crossings,  E  can  be  evaluated  directly  as  a 

function  of  p^k  1 ^ ,  the  first  correlation  coefficient  of  (z|k  1^). 
We  observe  that 


and  that 


,(k)  =  ,y(k)_  (k)  2 
t  (Xt  xt-lJ 


E  d 


(k)  _  „  . , Y ( k)  y(k) . 2 

t  E  {(Xt  Xt-l' 


=  E  {(x|k))2  +  (X^kj)2  -  2  x|k)  x|kJ  ) 


=  E  (X<k)>  +  E{x|kJ}  -  2  E  {x[k)  x|kJ  }  (3.4) 


since  the  x-processes  are  0/1-valued. 


The  assumption  that  (Z^)  is  zero-mean  Gaussian  implies  that 

(k) 

(Zt  }  is  also  zero-mean  Gaussian,  k  =  1,  2,***,  since  from 
(2.1)  the  higher  difference  processes  are  linear  combinations  of 
the  O-order  process .  Then 


E  (x|k) )  =  P  (xjk)  =  1) 


=  1/2 

and 

E  (x{k)  x[k}>  =  P  {(zjk_1)  i  0)  and  (Z^"1*  *  0)) 

=  1/4  +  (1/2 n)  sin"1  p|k_1)  (3.5) 


Thus  we  have 

E  d|k)  =  1/2  -  (1/w)  sin-1  P*k_1 


.»  •it.'-li  *l4. 


E  DkN  =  (N-l)  E  d<k» 

=  (N-l)  (1/2  -  (1 /n)  sin-1  p[k_1)]  (3.6) 


(k-D  _ 


n  ED 


f  k,N  1 

l  N-l  J 


0.7) 


Values  of  E  ^  for  selected  processes  with  N=1000  are 
shown  in  Tables  4-4  A,B,C,D. 

The  relationship  (3.7)  together  with  (3.2)  and  (3.3)  indica¬ 
tes  that  the  expected  zero-crossing  rates  are  uniquely  determined 
by  the  correlation  structure  of  the  underlying  Z^0^-  process. 


3.3  Convergence  of  HOC-processes 

It  is  instructive  to  consider  the  graphs  of  some  HOC- 
processes.  Figures  3. 1-3.3  show  the  graphs  of  several  HOC- 
processes  obtained  from  both  staionary  as  well  as  nonstationary 
time  series.  The  firs.;  striking  observation  is  that  the  HOC- 
processes  seem  to  converge  monotonically  to  a  limiting  HOC-process 
and  that  the  convergence  is  rather  fast.  This  is  the  vertical 
convergence  mentioned  earlier,  and  one  that  has  been  observed  in 
numerous  cases  as  already  noted  in  Kedem  and  Martin  (1987).  The 
horizontal  convergence  is  observed  in  the  figures  at  least  in  the 
stationary  cases.  We  shall  now  discuss  these  observations  more 
formally  below. 


A  basic  fact  which  underlines  the  vertical  convergence  is 


that 


Dk , N  5  Dk+ 1 , N  +  1 ‘ 


(3.8) 


.  ...  ^  V  V  j  -•  / 
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From  this  inequality  we  can  show  that  for  strictly  stationary 
processes  the  expected  HOC  constitute  a  monotone  sequence. 


WN  HOC  PROCESS (  0 1 -08  ) 


UN  HOC  PROCESSC  0 l  08  ) 


i  H  -j 


H  4  '  W 


I 


I  ''! 

HO0  '  OOl' 


-r- 

*00 


~r  ■ 

GOO 


Figure  3.1 
series . 


HOC-processes  obtained  from  two  white  noise  time 


Theorem  3.3.1.  Let  {2  }  be  a  strictly  stationary  process.  Then 


for  every  N 


Proof:  Define 


E  °k,N  5  E  Dk+1 , N  '  k  1,2 ' 


Then 


\Jk)  =  P(x[k)  =  l|xjkJ  =  1)  .  P(k)  =  P(xjk)  =  1) 


E  DkN  =  2 ( N-l )  p , 


and  together  with  (3.8 


2(N-l)p(k) (l-\Jk) )  2(N-l)p(k+1) (1-A jk+1) )  1 

FT!  <  N-l  +  ITT 


or,  as  N  — ►  oo 


2p(k> (l-A <k> )  s  2p(1"1,(l-X<k+1)). 


Therefore 


E  DkjN  =  2  ( N-l  )p(k)  (  l-\jk)  )  <  2(N-l)p(k+1)  (1-A.jk+1>  )  =  E  Dk+1  N 


Theorem  3.3.1  obviously  covers  stationary  Gaussian  processes,  but 
it  is  only  a  first  step  in  the  explanation  of  the  vertical  con¬ 
vergence.  For  a  general  result  we  may  appeal  to  the  theory  of 
point  processes  as  outlined  in  Kallenberg  (1976). 

It  is  convenient  to  restrict  attention  to  the  set  [0,ao) 
equipped  with  the  Borel  field  @.  Let  (B  be  the  class  of  subsets 
consisting  of  all  bounded  sets  in  ®.  The  set  of  all  locally 
finite  measures  that  are  nonnegative  and  Integer  valued  is  denoted 
by  H.  Assume  that  the  process  (Z^.)  is  defined  over  the 


^  ^  -» 


probability  space  (Q,4,P)  and  introduce  in  M  the  a-algebra  7 


generated  by  the  mappings  //  — >  //(B),  (J  e  M ,  B  e  B.  By  a  point 
process  on  [0,®)  we  mean  any  measurable  mapping  from  (0,4, P)  into 
(M ,7 )  .  With  an  obvious  extension  of  the  previous  notation,  let 
D^fB) ,  B  e  B,  be  the  number  of  zero-crossings  generated  by 
in  B.  Then  for  each  realizations  of  {Z^}  >  D^*) 
defines  a  measure  in  >  and  so,  for  each  fixed  k,  )  is  a 

point  process.  The  vertical  convergence  can  be  explained  with  the 
help  of  the  sequence  of  point  processes  {D^(*)>-  We  need  the 
following  combinatorial  lemma. 

Lemma  3.3.1.  For  any  process  (Z^)  there  exists  a  tQ ,  l5tQ<N 
and  tQ  depends  on  j,  such  that  for  every  N 

I  d<J>  <  l  +  ,3.9, 

t*l  t=l  ^ 

t-to  *-*0 

with  probability  one. 


Proof :  Fix  j.  Suppose  dj^+«**+d^^  =  0;  then  we  are  done. 

So  suppose  dj^+**»+d^^  >  0.  Then  there  exists  at  least  one 
symbol  change  in  (xj*^),  l<tsN.  Note  that  in  general  we  have  the 
implication 


(X 


( j ) 

t-1 


x[j) ) 


{x|j+1) 


=  {X 


(3) 


It  follows  that  there  exists  t^ 

x(  3  +  U 

'i 

Therefore  each  symbol  change  in 
symbol  change  in  for 


such  that 


(X^*^)  produces  at  least  one 
t  <t«N.  On  the  other  hand  by 


L« * vivi  iVajviVLilLi!! 


going  from  j  to  j+1  we  may  lose,  by  (3.8),  at  most  one  zero- 
crossing  for  lststj.  It  follows  that  there  exists  a  tQ  which 
depends  on  J  such  that  (3.9)  holds.  c 

The  lemma  says  that  in  the  binary  sequence  (x|^+1^)  ,  IstsN, 
there  are  N-l  locations  that  give  at  least  as  many  symbol  changes 
as  the  corresponding  locations  in  (X^^  }  ,  IstsN,  while  no 
information  is  available  about  the  remaining  location  which  we 
denote  by  tQ.  With  the  help  of  the  lemma  we  can  now  prove  the 
weak  convergence  of  the  sequence  of  point  processes  (D^(*)). 


Theorem 


3.3.2.  Let  (Z^)  be  a  strictly  stationary  process  and 


let  D ( • )  be  the  point  process  corresponding  to  the  limiting  HOC- 
process.  Then  Dk(-)  ■*  D(')  with  respect  to  the  vague  topology 
on  H . 

Proof :  For  any  Bel  we  can  have  at  most  a  finite  number  of 

integer  points  and  thus  at  most  a  finite  number  of  zero-crossings 

1C 

by  (V  Zt)  for  any  k.  Therefore 

lim  limsup  P(Dk(B)>r)  =  0 
r-K»  k-*» 

Next,  by  Theorem  3.3.1,  for  any  I  =  (a,b],  a,b  finite,  we  have  by 


monotonicity 


E  Dk(I)  — >  E  D ( I ) ,  k  — >  ®. 


Third,  consider  the  higher  order  crossings  N  and  Dk+^  jj. 

Then  by  the  lemma  there  exist  tQ,  and  D^+1  such  that 


D  =  D  1  +  d  ^  k ) 
k , N  k  tQ 

D  =  D 1  +  d  ^ ) 

k+l,N  k+1  tQ 


*»  .**  .  **  .**  .  *  m  *  -* 


rjj‘ 

Ptjf  i 
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Dk  5  Dk-u  surely. 


By  strict  stationarity  and  since  /  is  binary 


Therefore 


pK’ 4  *) 5  pK+1)  2 

p[Di +  * *] s  p(°k+i +  dJ^u !  >) 

p(Dk,H  -  °)  8  p [Dk+ 1 , N  ■  °) 


and  so  the  sequence  P(D, 


0)  converges  monotonically  as  k 


for  any  N.  By  appeaiing  to  Theorem  4.7  in  Kallenberg  (1976)  it 
follows  that  D^*)  converges  in  distribution  to  D(*). 

For  the  horizontal  convergence  we  require  ergodicity. 

Theorem  3.3.3.  Let  ( }  be  an  ergodic  zero-mean  stationary 
Gaussian  process.  Then  for  every  k,  k*l,2,***, 

. .  Dk, N  1  -1 .  ( k-1 ) . 

ilm  — r  “  n  cos  {pi  >  as- 


Proof :  Apply  (3.7).  a 

This  explains  the  convergence  to  straight  horizontal  lines 
observed  in  the  figures.  It  follows  that  for  Gaussian  white  noise 
the  straight  lines  are  at  levels  i  cos  1(-k/(k+l))<  k*0,l,2,*** 

When  both  the  order  and  the  size  increase  we  have  convergence 
towards  the  highest  possible  frequency  in  the  spectrum  if  the  later 
exists.  More  precisely  we  have.  Define 

^  1  N 

Dj(N)  .  -Y  ■ 
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Also,  the  normalized  spectral  measure  of  (V^Zt)  is  given  by 


v  j { dw ) 


|  sin  -  g)j  dF(cj) 
Sn  fsin  |  xl  dF(X ) 

—  71  ^ 


where  F  is  the  spectral  distribution  function  of  {Z^).  Let  6^ 
be  the  point  mass  at  u. 

00 

Theorem  3.3.4.  Let  (Z  )ts>1  be  a  zero  mean  stationary  Gaussian 

* 

process,  and  suppose  g>  s  n  is  the  highest  frequency  in  the 

_  • 

spectral  support.  If  11m  D,(N)  <  (j  /n  with  probability  one, 

N-*»  J 


j  =  1 , 2  ,  • ■ • ,  then 


lim  lim  D  .  ( N )  =  —  a.s. 
j-*oo  U~tao  J  n 


Proof.  From  Kedem  and  Slud  (1982)  .  ■*  I  6  *  +  r  6  *,  l-»<v. 

-  '  j  2  -w  2  w  J 

But  the  Gaussian  assumption  implies  that 

rn 

cos  (rrEDj  ( N)  )  =  j  cos(w)^j(dw)  — ►  cos(cj  ),  j  — >00. 

-n 

_  * 

Therefore  EDj(N)  — »  w  /n ,  j  — 1  oo,  uniformly  in  N. 

By  strict  statlonarity  we  can  define  the  a.s.  limit 

c.  =  lim  6 . ( N)  . 

J  N-kjd  J 

* 

Then  by  Lemma  1  c.  <  c.  a.s.,  and  so  lim  c.  5  c  w  /n  a.s. 

J  J  +  1  j-*B  J 

Thus  by  bounded  convergence  E(Cj)  — ►  E(c),  j  — *  ®.  Strict 
statlonarity  and  Fatou's  Lemma  yield 


E(c.)  2  lim  sup  ED.(N) 
J  N-*»  J 


Since  Ec,  — >  Ec  and  ED.,(N)  — ►  w  /n , 


E  (  c )  ?  o>  /n  . 


*  * 

But  c  *  u  /n .  Therefore  c  =  u  /it  a.s. 


Corollary  3.3.1.  If  <j  =  n  then 


11m  lim  D  .  (N)  =  1 . 
j-tOD  N-*00  ^ 


We  close  this  section  by  noting  that  Theorem  3.3.2  which 
addresses  the  vertical  convergence  is  somewhat  more  general  than 
the  Higher  Order  Crossings  Theorem  proved  in  Kedem  and  Slud  (1982) 
and  which  says  that  for  a  strictly  stationary  process  with 
spectral  distribution  function  F  supported  at  n,  as  k  — ►  ao. 
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Theorem  3.3.2  does  not  presuppose  that  (Z^.)  possesses  moments  at 


all,  and  strictly  speaking,  it  is  not  a  spectral  result  but  can  be 
viewed  as  such  in  a  generalized  sense.  It  is  important  to  observe 
that  a  process  need  not  possess  moments  of  any  order  and  yet  its 
HOC-processes  possess  moments  of  all  orders. 
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4.0  ESTIMATION  OF  VARIANCE  OF  D-PROCESSES 
4.1  Direct  Computation 

In  order  to  use  zero-crossing  statistics  for  testing,  one 
needs  at  least  variance  estimates  for  the  These  estimates 

are  in  general  functions  of  fourth  order  orthant  probabilities  in 
which  are  not  easily  obtainable. 

In  addition  calculation  of  the  fourth  order  orthants  leads  to 
a  difficult  computational  problem  since  the  error  appears  to 
increase  rapidly  with  N. 

In  what  follows  we  discuss  in  some  detail  the  problem  of 
approximating  the  variance  of  Fortunately,  it  is  possible 

to  obtain  a  very  reasonable  approximation  under  a  certain 


condition. 


4.2  Simulation 


As  an  alternative  to  direct  computation  of  the  variance, 
approximations  to  the  variance  of  ^  can  be  obtained  from 

computer  simulations.  The  computer  is  programmed  to  generate 
realizations  of  a  specified  process  from  which  counts  of  the 
higher  order  zero-crossings  are  made.  Counts  are  obtained  for  a 
number  of  realizations,  and  the  sample  mean  and  variance  of 
these  statistics  are  calculated. 

In  our  study,  we  have  used  computer  simulation  results  as  a 
comparison  for  variance  estimates  obtained  by  analytical  methods. 
We  have  implemented  these  simulations  both  on  a  large  mainframe 
computer  (UNIVAC  1108)  using  FORTRAN,  and  on  an  IBM  personal 
computer  using  TurboPASCAL.  We  note  that  while  the  execution  time 
is  considerably  faster  on  the  mainframe,  running  time  on  a  PC  is 
not  prohibitively  slow  (approximately  20  minutes  for  100  itera¬ 
tions  with  1000  observations  and  six  levels  of  higher-order 
crossings)  and  offers  some  advantages  of  flexibility  and  lower 
cost . 

Some  examples  of  simulation  results  are  shown  in  Tables 
4- 1 A , B , C  and  4-2A,B,C.  The  results  agree  with  an  earlier 
simulation  reported  in  Kedem  (1987) 
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TABLE  4-1A 


Simulation  Results  for 

White  Noise  Process 
(100  iterations) 


N 

*  D1,N  (™r 

1/2 

10 

4.71 

1 . 54 

20 

9 . 96 

2.03 

50 

24.93 

3.46 

100 

49 .61 

4.92 

200 

99.46 

7.04 

500 

249 . 76 

10.16 

1000 

499 . 15 

15 . 28 

TABLE  4- IB 

Simulation  Results  for  D1 

,  N 

AR(  1 )  Process  <p*0. 5 

( 100  iterations) 

N 

*  °1,N  (H 

[».-.)] 

1/2 

10 

3 . 65 

1 .44 

20 

6 . 08 

2 . 55 

50 

16 . 52 

3 . 53 

100 

32 . 77 

4.89 

200 

67 .43 

6.88 

500 

167.56 

11.64 

1000 

336.89 

15.91 

.N  A  A  » A  A  A  A  w‘* 


.A  A  A  .  .  V  A  .V. 


•VAva.\V 


TABLE  4-2A 


Simulation  Results  for  N 

White  Noise  Process 
( N=1000 ,  100  iterations) 


ED... 
k ,  N 

Mv»)J 

1 

500 . 16 

16.23 

2 

668 . 00 

12.05 

3 

732 . 26 

11.86 

4 

768 . 49 

11.24 

5 

793 . 20 

10.93 

6 

811.74 

10.64 

TABLE  4-2B 


Simulation  Results  for 


D 


k ,  N 


AR ( 1 )  Process  <(>=0.5 
( N=1000 ,  100  iterations) 


k 


E  D 


k ,  N 


1/2 


1 

335 . 16 

16.58 

2 

578.87 

13 . 54 

3 

683 . 52 

13.94 

4 

739 . 10 

12.00 

5 

773.83 

11.64 

6 

797 . 95 

10.64 

4.3  Approximation  Assuming  Negligible  Comulants 


For  any  stationary  (Z^)  (not  necessarily  Gaussian)  with 


known  mean  // ,  define 


ch  -  c-h 


=  T^h  2 


h  =  0  ,  1 , • • •  , T-  1 


which  is  an  unbiased  estimate  of  / 


It  is  shown  in  Anderson  (1971,  p.452)  that 


T-h|  var  c  -  T  I  '  (l  *  'rth  'r-h 

r=- ( T-h- 1 ) 


+  h  ( h ,  -  r ,  h- r )  ] 


where  x  is  the  cumulant  function 


jr  (  h  ,  r  ,  s  )  =  E  {  (  Z  t  -  // )  (Z  t+h~/i)  (Z  t  +  r~/i)  (Zt+S'/')) 

-  /  r_s  +  /  r  '  h_s  *  '  s  '  h-r] 


t  =  •••,  1,  0,  1, 


When  (Z^.)  is  Gaussian  x  vanishes. 
We  observe  that 


d  =  y  d 
1'N  i-2  1 


..  V  fyd)  V(D] 

-  £  (xi  -  xi-ll 


N  ( 

=  E  * 

i  =  2  L 


2  N 


i  [x!1;]  -  2  %  x(i)  x< 

i  =  2  1 


y  fx(1)  -  1/2)  f x ( 1 !  -  1/2]  +  ( n-  1 


N-  1 

2 


SB  wjvjir.vi 


-  (N-l)  2C 


X '  '  (  1  ) 

where  is  the  function  (4.1)  for  the  X  -process.  Thus, 


D  i  N  i  r  y  (  1  )  'i 

Hft  -  1  °?  ]  • 


4.3) 


If  we  can  assume  that  the  cumuiants  are  negligible  for  the 
processes  we  are  interested  in,  then  (4.2)  is  easy  to  evaluate  and 
can  be  used  to  give  estimates  of  the  variance  of  N  by  using 

(4.3)  . 

We  calculated  estimates  of  the  variance  of  L)  for 

1  ,  N 

Gaussian  White  noise  and  for  Gaussian  AR ( 1 )  processes  with  para¬ 
meters  0.5  and  0.2.  The  results  are  shown  in  Table  4-3.  For 
white  Gaussian  noise  the  cumuiants  are  0,  so  (4.2)  works  exactly 
for  this  process.  However,  the  estimates  for  the  AR(1)  processes 
appear  to  overestimate  the  variance  compared  to  the  simulation 
results  in  Table  4-2B,C,  and  the  differences  are  larger  as  the 
AR(1)  process  moves  away  from  white  noise.  Thus  we  conclude  that 
this  method  does  not  provide  good  variance  estimates  for  processes 
other  than  white  noise.  In  other  words,  the  cumulant  function 
contribution  to  (4.2)  is  appreciable. 
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TABLE  4-3 

Results  of  Approximation  Based  on  the  modified 
Expression  (4.2)  For  (Var(D  ))1/2 


Process 


N 

White  Noise 

AR  (  1  ) 
0=0.5 

AR  (  I  ) 
0=0.2 

10 

1  .  50 

1  .  95 

1 . 57 

20 

2.18 

2 . 88 

2 . 28 

50 

3 . 50 

4 . 66 

3 .67 

100 

4 .97 

6 . 64 

5.22 

200 

7 . 05 

9.44 

7.41 

500 

11.17 

14.95 

11.73 

1000 

15.00 

21.16 

16.60 

/•_  f 


4.4  Approximation  for  White  Noise 


Kedem  and  Reed  (1986)  showed  that  for  an  m-dependent 
stationary  Gaussain  process  (Zt)  with  n  in  the  spectral  support 
and  for  fixed  N 


where 


1  im  - 

k-*v  (N-l)i 


Var<Dk,N> 


(k! 


(lVk)) 


(4.4) 


(k! 


P(x 


■;k! 


i  > 


=  1/2  +  1 / h  sin 


■  -1  „ ( k- 1 ) 


1 


k=l  , 2 , 


Thus  approximation  to  the  variance  of  ^  is  provided  by 

Var(Dk  N)  =  (N-l)  »|k)  (l-*jk))  (4.5) 

This  method  is  incorporated  in  the  method  using  first  order 
Markov  approximation  (Section  4.5).  We  may  view  (4.5)  as  O'th 
order  approximation. 


4.5  Approximation  assuming  first  order  Markov  process. 

The  difficulties  in  the  computation  of  Var(D.  .. )  can  to  a 

K  ,  N 

large  extent  be  passed  by  assuming  that  the  binary  process  (djk'} 
t=C, » 1,* • ■ ,  is  a  Markov  chain  for  each  fixed  k.  This  idea  has 
been  suggested  and  implemented  in  Kedem  (1987)  where  it  is  shown 
that  the  resulting  approximation  is  surprisingly  good.  The  method 
is  summarized  in  the  following  algorithm. 

Given  '’k+i  ’  we  can  c°mPute  E  ^  exactly  and 

approximate  Var  ( ^)  by  following  steps  17. 

Step  J_.  Obtain  />  j k  ^  from  (3.2). 

2  9 


Obtain  />  ^  from  (3.3). 


Compute 


t(k>  J  ♦  1  sin"1  .  j  =  1,2 

J  2  n  J 


Compute 


Step  5 .  Compute 


1  Vk>  .  q(k) 


i-2»;k)  ♦  .<k> 


2  (  1  -  A  j  k  ]  ) 


The  exact  expected  value  of  is  given  by 


E  Dk<N  =  ( N -  1  )  p 


The  variance  of  is  approximated  by 


(kl  (k,  ,p«k»,'k»(.-'k'  p1"', 

Var(Dk  „)  =  (N-1)pl  ><,'  * - '--no- - 


( N-  1  )  -V 


k ,  N  | 


(4.6) 


where 


VkV-olkyk,)^l)H;1] 

\.N 

The  variance  expression  (4.6)  can  be  recognized  as  the 


variance  of 


4”'  * 


under  the  assumption  that  (d^  )  is  a  two  state  stationary 


Markov  chain  with  parameters  p^kl  and  t  *k^ 


Observe  that  (4.6) 


is  (4.5)  plus  a  "correction"  term.  Tables  4 . 4A  -  4 . 4D  show  some 
computations  using  formula  (4.6).  These  results  agree  closely 


with  simulation  results.  Subroutine  DVAR  attached  at  the  end, 


uses  the  above  algorithm  in  the  computation  of  (4.6). 


TABLE  4  -  4 A 

Results  of  First  Order  Markov 
Approximation  for  White  Noise 
(N  =  1000) 


1 

499.50 

15.80 

2 

666 . 00 

13.14 

3 

731 .55 

12  . 

.  17 

4 

769  .  18 

11.58 

5 

794 . 37 

1  1  , 

.  1  6 

6 

812.76 

10.82 

TABLE  4  -  4 B 

Results  of  First  Order  Markov 
Approximation  for  AR(1)  0  ~  0.5 
(N  =  1000) 


1 

333 . 00 

15.76 

2 

579 . 85 

13.99 

3 

684 . 69 

12.74 

4 

729 . b7 

12.00 

5 

773.82 

1  1 

.  49 

6 

797.44 

1  1 

.  10 

1  1 
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TABLE  4-4C 

Results  of  First  Order  Markov 
Approximation  for  AR(1)  </>  =  0 . 2 
(N  =  1000) 


/•  " 

Var  D.  „ 
[  k ,  N 


435.47 
630 . 36 
709 .85 
754 .45 
783.61 

804 . 48 


15.82 

13.41 

12.39 

11.77 

11.33 

10.97 


TABLE  4 -4D 

Results  of  First  Order  Markov 
Approximation  for  AR(1)  0  =  -0.8 
(N  =  1000) 


Var(Dk.N 


794 . 37 
855.58 
876.17 
887.93 

895 . 97 

901.98 


14.74 
11.16 
10. 10 
9 . 53 
9.16 
8 . 89 


«j 
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5.  Testing  for  white  noise:  an  application  of  HOC  processes 

Finally,  after  discussing  the  dynamic  behavior  of  HOC- 
processes  in  regard  to  order  and  time,  we  are  ready  to  apply  this 
interesting  dynamics  in  tests  appropriate  for  time  series  models. 

We  concentrate  on  testing  for  white  noise  but  the  technique  can  be 
readily  extended  to  more  general  tests  regarding  autoregressive- 
moving  average  processes. 

As  noted  earlier,  HOC  processes  summarize  the  oscillatory 
characteristics  of  a  process.  Moreover,  it  is  apparent  from 
Figures,  3. 1-3. 2  that  for  stationary  time  series,  HOC-processes 
produce  sample  paths  which  settle  about  straight  lines  even  for 
moderate  sample  sizes,  and  that  the  initial  rate  of  the  vertical 
convergence  is  rather  fast  which  is  a  general  property  of  higher 
order  crossings.  Thus,  in  this  sense,  it  is  possible  to  characte¬ 
rize  a  process  by  a  few  HOC-processes  observed  between,  say, 

N  =  500  to  N  =  1000  Experience  shows,  and  this  will  be  illustrated 
again  below,  that  as  few  as  six  HOC-processes  need  to  be  considered. 
Based  on  these  observations,  the  proposed  test  for  white  noise  is 
to  observe  whether  the  first  six  HOC-processes  produce  sample  paths 
which  indeed  fall  within  prescribed  probability  bounds  derived 
under  the  hypothesis  of  white  noise.  The  hypothesis  of  white  noise 
is  rejected  as  soon  as  at  least  one  of  the  six  paths  cross  its 
probability  bounds.  Such  a  scheme  is  akin  to  control  processes 
and  can  be  used  in  real  time  monitoring  of  man  made  systems. 

Observe  that  under  the  Markov  assumption,  has  an 

asymptotic  normal  distribution  so  that  it  lies  in 


E  Dk_„  ■  1.96  /v»r<Dk_N> 


(5.1) 


with  95%  approximately,  for  each  fixed  k  and  large  N.  However 

as  N  varies  we  must  obtain  a  simultaneous  bound  for  the  HOC  path 

{D,  .,/N),  N,  *•  N  N„.  In  this  work  we  take  N.  =  500,  N  =1000, 

K ,  N  1  2  1  2 

!■ k'6,  The  bounds  are  determined  from 


c/Var  (  D.  1,  500-  N-  1000 

k  ,  N 


(5.2 


for  each  k,  l-k-6,  where  (Var(Dk  ^ ) } 


i s  given  by  (4.6) 


assuming  white  noise.  The  constant  c  controls  the  overall  power 
and  the  o-level  of  the  test.  When  c=1.96  the  test  is  very 
powerful  against  many  alternatives  but  also  has  a  high  a- level. 

In  order  to  lower  the  level  we  widen  the  bounds  by  choosing  larger 
values  for  c,  a  procedure  that  leads  to  more  conservative  bounds. 


5.1  Power  considerations  and  examples 

Simulations  were  first  run  to  estimate  the  power  of  the  test 
for  white  noise  as  a  function  of  c,  the  test  width  coefficient  in 
(5.2).  Three  processes  were  simulated:  white  noise  and  two  AR ( 1 ) 
processes  with  parameters  0.05  and  0.1.  The  resulting  HOC-process 
paths  were  monitored  using  the  white  noise  limits  obtained  from 
(4.6)  and  (5.2).  Figures  5. 1-5.3  show  the  percentage  of  cases 
failing  the  test  for  the  three  simulations,. 

From  Figure  5.1  it  appears  that,  a  c-value  close  to  3  corres- 
pons  to  a  level  around  5%.  In  fact,  we  obtained  an  it- level  of 
0.06  for  C=2.8.  The  power  corresponding  to  c  =  2.8  is  about  20% 
and  40%  for  0=0.05  and  0=0.1,  respectively.  This  is  shown  in 
Figures  5.2,  5.3.  The  curves  shown  in  the  three  figures  were 
obtained  by  a  least -squares  fit. 


Table  5.1  gives  the  results  of  some  power  calculations 
obtained  from  100  simulations  for  each  alternative  from  ARMA(1,1). 
The  value  of  c  is  2.8.  The  further  the  alternative  is  from 
white  noise  the  higher  is  the  power  as  expected. 

Examples  which  illustrate  the  use  of  the  test  procedure  are 
shown  in  Figures  5. 4-5. 6.  For  non-white  noise  in  many  cases 
higher  order  HOC-processes  resemble  those  from  white  noise  and  the 
distinction  is  made  by  low  order  HOC-processes.  Occasionally  it 
is  the  D6-path  that  captures  the  difference  while  the  lower  order 
cases  do  not. 
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0 
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0 
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0 

15 

0 

00 

0 
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0 

20 

0 

00 

0 
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0 

10 
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10 

0 

97 

0 
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0 

20 
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00 

0 

20 

0 

20 
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00 

0 

05 

0 

30 

1 

00 

Table  5.1.  Power  for  the  white  noise  test  against  alternatives 

■ 

i  from  Z  =  0Z  f  u  -  Ou  where  u  are  independent  N(0,1) 

j  t  t  t  t  —  i  t 


random  variables. 


Figure  5.5  Hoc-processes  from  AR(1),  0=0.1,  and  white  noise 
limits.  The  hypothesis  of  white  noise  is  rejected. 


5,2  Identification  of  general  models. 

There  is  no  reason  why  the  above  procedure  cannot  be  extended 
to  test  for  non-white  models,  because  once  a  model  is  completely 
specified  and  its  correlation  structure  is  known  as  a  result, 

(4.6)  and  (5.2)  can  be  evaluated  and  the  limits  can  be  obtained  as 
before.  Our  experimental  results  indicate  that  to  maintain  a 
reasonable  test  level  the  value  of  2.8  for  c  in  (5.2)  is  still  a 
good  choice.  Table  5.2  gives  some  power  calculation  for  testing 
the  hypothesis  that  a  process  is  AR  (  1  )  with  parameter  <7>  -  0.1, 
where  the  alternatives  are  generated  by  ARMA ( 1 , 1 )  with  parameters 
0,0  as  above. 
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Table  5.2.  Power  of  the  test  for  testing  the  hypothesis  that  the 
process  is  AR ( 1 )  with  0  -  0.1.  The  alternatives  are  generated  by 


different  ARMA(1,1)  processes. 


6 .  Summary 

We  have  outlined  a  test  procedure  for  time  series  models 
based  on  HOC-processes.  The  test  can  be  used  in  online  situations, 
for  example  when  monitoring  the  oscillation  of  signatures  obtained 
from  man  made  systems  such  as  engines.  As  was  indicated,  if 
higher  a-levels  can  be  tolerated,  the  power  against  different 
alternatives  can  increase  appreciably.  This  can  be  achieved  by 
lowering  c  and  thus  rendering  the  limits  less  conservative. 

The  ideas  presented  in  this  paper  can  be-  extended  to  cover 
nonstationary  time  series  as  well  since  the  vertical  convergence 
is  quite  a  general  phenomenon  and  seems  to  hold  across  a  wide 
range  of  processes. 

The  present  paper  is  a  generalization,  an  application  of  HOC 

to  one  based  on  HOC-processes.  For  each  fixed  N  the  sequence 

{ ,  k  =  1  , 2 , ‘ •  ,  is  the  sequence  of  higher  order  crossings . 

It  is  known  that  the  rate  of  increase  in  HOC,  and  hence  also  In 

HOC-processes,  is  a  useful  feature  in  discriminating  between 

processes  as  is  exemplified  in  Figures  3.  1-3.3.  Tests  based  on 

{D.  ..  I  for  fixed  N  are  discussed  in  Kedem  (1987).  It 

1  ,  IN  b  ,  N 

seems  that  no  real  advantage  is  gained  by  increasing  the  order  k 
in  the  present  use  of  HOC-processes  amidst  stationary  time  series. 
On  the  other  hand  by  increasing  N  we  can  obtain  a  more  powerful 
test.  This  pertains  particularly  to  borderline  cases  that  need 
"more  time"  to  get  out  of  the  probability  bounds  as  they  become 
narrower  and  narrower. 

Finally,  the  method  described  herein  impinges  on  sequential 
analysis.  This  connection  will  be  pursued  elsewhere. 
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